Diverse Dialogue Generation by Deep Learning Methods Using Loss Function IncorporatingWord Statistics
نویسندگان
چکیده
In recent years, there has been a lot of research on building dialogue systems using deep learning, which can generate relatively fluent response sentences to user utterances. Nevertheless, they tend produce responses that are not diverse and less context-dependent. Assuming the problem is caused by Softmax Cross- Entropy (SCE) loss, treats all words equally without considering imbalance in training data, loss function Inverse Token Frequency (ITF) multiplies SCE weight based inverse token frequency, was proposed confirmed improvement diversity. However, diversity sentences, it necessary consider only information independent tokens, but also frequency incorporating sequence tokens. Using frequencies incorporate tokens compute weights dynamically change depending context, we better represent seek. Therefore, propose function, N-gram (INF) weighted n-gram instead order confirm effectiveness method INF conducted metric-based human evaluations automatically generated models trained Japanese English Twitter datasets. evaluation, Perplexity, BLEU, DIST-N, ROUGE, length were used as evaluation indices. assessed coherence sentences. model achieved higher scores ROUGE than previous methods. showed superior values.
منابع مشابه
Forecasting the Tehran Stock market by Machine Learning Methods using a New Loss Function
Stock market forecasting has attracted so many researchers and investors that many studies have been done in this field. These studies have led to the development of many predictive methods, the most widely used of which are machine learning-based methods. In machine learning-based methods, loss function has a key role in determining the model weights. In this study a new loss function is ...
متن کاملDeep Reinforcement Learning for Dialogue Generation
Recent neural models of dialogue generation offer great promise for generating responses for conversational agents, but tend to be shortsighted, predicting utterances one at a time while ignoring their influence on future outcomes. Modeling the future direction of a dialogue is crucial to generating coherent, interesting dialogues, a need which led traditional NLP models of dialogue to draw on ...
متن کاملDeep Active Learning for Dialogue Generation
We propose an online, end-to-end, neural generative conversational model for opendomain dialogue. It is trained using a unique combination of offline two-phase supervised learning and online human-inthe-loop active learning. While most existing research proposes offline supervision or hand-crafted reward functions for online reinforcement, we devise a novel interactive learning mechanism based ...
متن کاملPerceptual audio loss function for deep learning
PESQ, Perceptual Evaluation of Speech Quality [5], and POLQA, Perceptual Objective Listening Quality Assessment [1] , are standards comprising a test methodology for automated assessment of voice quality of speech as experienced by human beings. The predictions of those objective measures should come as close as possible to subjective quality scores as obtained in subjective listening tests, us...
متن کاملDeep Learning for Dialogue Systems
In the past decade, goal-oriented spoken dialogue systems have been the most prominent component in today’s virtual personal assistants. The classic dialogue systems have rather complex and/or modular pipelines. The advance of deep learning technologies has recently risen the applications of neural models to dialogue modeling. However, how to successfully apply deep learning based approaches to...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Transactions of The Japanese Society for Artificial Intelligence
سال: 2022
ISSN: ['1346-0714', '1346-8030']
DOI: https://doi.org/10.1527/tjsai.37-2_g-l62